19. Notebook + Quiz: Multicollinearity & VIFs

Workspace

This section contains either a workspace (it can be a Jupyter Notebook workspace or an online code editor work space, etc.) and it cannot be automatically downloaded to be generated here. Please access the classroom with your account and manually download the workspace to your local machine. Note that for some courses, Udacity upload the workspace files onto https://github.com/udacity , so you may be able to download them there.

Workspace Information:

  • Default file path:
  • Workspace type: jupyter
  • Opened files (when workspace is loaded): n/a

Based on the scatterplot matrix in the first question, select all the below statements that are true.

SOLUTION:
  • It appears that the predictor variables are correlated with one another.
  • The variables that appear to be most correlated are the number of bedrooms and bathrooms.

Select all that are true about the coefficients in your multiple linear regression model.

SOLUTION:
  • As the number of bathrooms increases, we predict the price to increase.
  • As the area of the home increases, we predict the price to increase.

Which one of the statements below reflects the action that should be taken based on the VIFs?

SOLUTION: We should remove either bedrooms or bathrooms, because they both have VIFs greater than 10.

Mark all of the statements below that are true with regard to your final results.

SOLUTION:
  • All VIFs are now below 10.
  • All of the coefficients are now positive, as we would expect.
  • To three digits the Rsquared value stayed the same, suggesting we didn't really need both bedrooms and bathrooms in the model.